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Genomic clones are purified DNA segments replicated 
directly from DNA of a source organism. Cenomic clones are 
distinguishable from cDNA clones in that the' latter are 
transcribed from isolated messenger RNA (mRNA). The dis¬ 
tinction is not only opertionally but chemically significant, 
at least where the source organism is eucaryotic. 

Recent advances have led to the realization that eu¬ 
caryotes differ radically from procaryotes with respect to 
the structure and organization of eucaryotic DNA. In most 
eucaryotes., only a minor proportion of total DNA actually 
encodes specific amino acid sequences. Large untranslated 
regions have been known for some time to exist between 
coding segments. More recently, it has been discovered that 
coding segments themselves are interrupted by untranslated 
intervening sequences of substantial length. The biological 
and evolutionary significance of these intervening sequences 
is still a subject for peculation. However, a general 
understanding of the functional mechanics of transcription 
and translation of eucaryotes is beginning to emerge. 

Genes containing intervening sequences, sometimes 
termed introns, are initially transcribed in their entirety 
in the nucleus of the eucaryotic cell. The initial mRNA 
transcripts are then edited or processed by unknown means to 
remove the intervening sequences and produce a coherent, 
uninterrupted mRNA coding sequence, which is transported to 
the cell cytoplasm. Additional mRNA processing steps occur, 
including a "capping 1 ' reaction that modifies the S'-end and 
addition of a poly A sequence at the 3*-end. None of these 
processing steps is observed in procaryotes. For a general 
review of current understanding, see Dulbecco, R., Micriobiol 
Revs . 43, 443 (1979). 

A~practical consequence, from the standpoint of 
recombinant DNA technology, is that genomic clones are not 
directly expressible by procaryotfe9-, although conceivably 
they might be improperly expressed, introns and exons, to 
yield some sort of chimeric protein. Similarly, cloned 
cDNA, which is expressible in procaryotes, may not be 
efficiently expressed, if at all, in eucaryotic hosts. 
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Therefore, where expression by a eucaryotic host is desired, 
the use of genomic clones is deemed to be the preferred tech¬ 
nique. 

Expression by eucaryotic hosta is useful in circum- 
5 stances where the desired protein is initially expressed as 
a precursor protein which itself requires additional 
processing. Such process steps include the specific cleavage 
of the initial translation product or a sequence of such 
steps, addition of glycosyl residues at specific sites, and 
10 incorporation into dimers, tetramers or multimeric associa¬ 
tions with other proteins. Expression in eucaryotic hosts 
permits these steps to be accomplished as part of the 
eucaryotic expression process. In contrast, expression in 
procaryotes frequently requires additional difficult and 
15 time-consuming in vitro steps of low yield to accomplish the 
same result. Alternatively, special modifications of the 
coding sequence must be made prior to transformation of the 
procaryotic host. Insulin, for example, is initially 
expressed as pre-proinsulin, a protein product carrying the 
20 A and B peptides of insulio, an intervening. C peptide, and 
an N-termi'nal signal peptide. The latter peptide permits 
vectorial transport of the protein into the endoplasmic 
reticulum of pancreatic B cells, and is removed during this 
process. The C peptide is believed to function to permit 
25 folding of the protein such that the A and B peptide chains 
are properly juxtaposed for formation of the correct 
disulfide bridges in the mature insulin molecule. Procaryotic 
expression of insulin cONA requires additional in vitro 
steps to carry.out the necessary processing of the expression 
30 product. One prior art strategy has been the complete 
chemical synthesis of sequences coding for the A and B 
peptides, separate procaryotic expression of these peptides 
and i£ vitro joining of the peptides by chemical means 
(Goeddel, D.V., et al, Proc. Nat. Acad. Sci. USA , 106 

35 (1979)* A separate strategy is based upon procaryotic 

expression'' of human proinsulin cDNA with correct joining of 
the disulfide pairs, a consequence of proper folding induced 
by the presence of the C peptide, followed by. iri. vitro.., 
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excision of the C peptide - see our co-'pending European 
Patent application 80303175.4 filed on 11 September 1980. 

In both cases, the normal processing functions of 
the eucaryotic cell are absent in the procaryotic host and 
5 must be supplied by less efficient in vitro methods. Ex¬ 
pression in eucaryotic cells therefore presents advantages 
for production of human insulin. 

ThO human body produces a large variety of peptides 
and proteins which serve a regulatory function. These 
10 include for exiample insulin, growth hormone, ACTH, somato¬ 
statin, beta-endorphin, nerve growth factor, epithelial 
growth factor and the somatomedins. Normally, the production 
of such proteins is carried out by highly specialized cells 
Which produce these substances in small and precisely 
15 controlled amounts. It would be desirable to obtain 

* sufficient quantities of these substances for therapeutic 
purposes, to supplement their missing function and for the 
treatment of deficiency states. However, the feasibility of 
producing such a regulatory protein with the aid of eucaryotic 
20 cells transformed by a specific gene has never been demon¬ 
strated in the prior art. 

Normally, human insulin is produced by the highly 

differentiated B cells of the Islets of Langerhans in the 

pancreas. The .production of insulin is highly regulated, 

25 since it has powerful hormonal effects. Both over-production 

and under-production are harmful to the organism. The same 

is true for other regulatory hormones of the body, including 

the compounds discussed supra . Since somatic cells of the 

organism are believed to carry out the same genetic 

« 

30 information, it is reasonable to expect that special 

regulatory mechanisms exist to prevent production of such 
hormones by all but the highly specialized endocrine cells. 
Productionof insulin or other hormones 'by the. corresponding 
specific cells is not practical due to the fact that 
35 differentiated cells are difficult to culture, or tend to 
lose their special synthetic capabilities after one or two 
generations. However, various stable cultured cell lines are 
known which have been maintained in culture for years. If 




4 0037723 

such cells could bs induced to express either their endo¬ 
genous insulin gene or an - exogenous, cloned insulin gene, 
the problems presented by procaryotic expression would be 
overcome. Even if a stable differentiated cell line could 
5 be found to produce the desired hormone, as occasionally 
occurs with certain tumours, the added flexibility to use 
modified DNA to produce a modified protein provides a unique 
advantage o.f the recombinant DNA approach. 

The process of controlling the- gene expressiqh in eu- 
10 caryotic cells is complex and imperfectly understood (see 
Dulbecco, R., supra ). Virtually nothing at all is known 
about the special processes regulating hormone synthesis. 

The art of transforming eucaryotic cells is in its infancy. 
That transformation can occur is known. Very little is 
15 known about the fate of the transforming DNA t whether it 

becomes integrated in the host chromosome or replicates by 
the use of a transfer vector bearing its own replication 
origin. 

Successful attempts have been made to transform mouse 
20 "L" cells, a well-etablished cell line derived from a mouse 

fibroblast tumour. Kit, S. , et al, Exp. Celi Res ., 31 , 297 
(1963). In that system, the cell strain lacks thymidine 
kinase activity (tk“), but could be transformed to tk + by 
Herpes simplex virus DNA codinq for thymidine kinase. 

25 (Wigler, M. , et al, Cell , 11_, 223 (1977) ). Further studies 
showed that co-transformatTon with thymidine kinase DNA and 
DNA from bacteriophage 0X 174, plasmid pBR 322 or the cloned 
chromosomal rabbit betaglobin gene indicate that cells 
which have received and expressed the thymidine kinase gene 
30 are at least capable of carrying the above-mentioned 

exogenous DNAs, Wigler, M., et al., Cell , 16 , 777 (1979). 
Similarly, mutant "L" cells lacking both tR¥ thymidine kinase 
and adenin 6 phosphoribosyltransferase (Ltk aprt ) were trans¬ 
formed to express adenine phosphoribosyltransferase activity 
35 by unfractionated high molecular weight genomic DNA from 
Chinese hamster, human and mouse cells and by restriction 
endonuclease digested DNA from rabbit liver. However, in the 
absence of selection pressure, i.e. during growth in a medium 
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where functional adenine phosphoribosyltrahsFera'sc was not 
required, some of the transformed strains lost the ability 
to express the transforming gene. More recently, expression 
of the chicken ovalbumin gene in mouse "l" cells (Ltk“aprt~) 

5 was achieved using a plasmid (pBR322) containing both a genomic 
clone of the ovalbumin gene, and three copies of the Herpes 
simplex virus thymidine kinase gene. (Lai, E.C., et al., 

Proc. Nat. Acad. Sci. USA, 77, 244 (1980) ). In that case, 
the transformants having thymidine kinase activity 
10 also synthesized chicken ovalbumin, as identified by immuno- 
reactivity. However, the rate of ovalbumin synthesis varied 
over two orders of magnitude in independent isolates of 
transformed cells. 

The widespread use of thymidine kinase stems from the 
15 fact that it is one of the few selection systems available 
for use with cultured cells. Cells lacking thymidine kinase 
are able to grow satisfactorily in a normal growth medium 
such as Oulbecco's modified Eagle's medium. However, 
functional thymidine kinase is required if the cells are 
20 to grow in medium supplemented by hypoxanthine, aminopterin 
and thymidine, commonly termed HAT. Following transformation, 
cells able to grow in HAT medium are those which have 
acquired the thymidine kinase gene and are expressing funct¬ 
ional thymidine kinase. When it is desired to test for trans- 
25 formation by another gene for which no selection technique is 
known, the cells are co-transformed with a thymidine kinase 
gene to enable the investigator to identify transformed 
cells, by HAT selection. Generally, more extensive experi¬ 
mentation is required to demonstrate transformation by a gene 
30 not otherwise selected for. 

Expression has also been attempted using a defective 
simian virus 40 (SV40 vector) containing a portion of the 
SV40 genome including the late, promoter region. Mouse beta- 
globin 0NA was joined with the SV40 vector and introduced 
35 into monkey kidney cells. The mouse gene was expressed at • 
high frequency into mouse beta-globin. However, trans¬ 
cription appeared to have inititated at the SV40 late 
romoter region rather than the beta-globin gene promoter. 



0037723 


6 

" lf initiated from the SV40 promoter, the expression product 
may have t>een a fusion protein or may have been otherwise 
inaccurately inititated. (Hamer, D.H., et al, Natu re, 28j , 

35 (1979 ). • 

5 From the foregoing discussion of the prior art, it is 

concluded that, while expression of genomic DNA in eucaryotic 
cells is at least feasible incertain cases, little is known 
about the factors affecting the fate of the transforming 
DNA or the nature and amount of expression product formed. 

3(1 Specific results in the art remain highly unpredictable. 

The present invention demonstrates for the first time 
the expressipn of a mammalian hormone in a eucaryotic cell 
into which has been transferred a genomic clone containing 
the gene for the hormone. In particular, mouse "L" cells 
15 . have been transformed with the genomic clone of human 

• insulin and'have been shown to express a product immuno- 
logically reactive as insulin. Cells not so transformed 
fail to express the insulin product. In contrast to 
previously reported results, only a minor fraction of the 
20 cells expressing thymidine kinase were also discovered to 
express insulin. Since little is known currently of the 
biological rules governing the expression of hormone genes 
in cells not specialized for the production of that hormone, 
the isolation of clones expressing a hormone gene could 
25 not have been• predicted. There was some variation in the 
amount of insulin expressed in the three insulin-expressing 
strains. However, it was possible to observe expression at 
the level of 1 ,D 00 to 2,000 insulin molecules per cell. 

The present invention provides a genomic clone of a 
30 hormone, a plasmid transfer vector comprising said genomic 
clone, a microorgranism containing and replicating said 
transfer vector, a transformed mammalian cell line capable 
of expressing the hormone and a method of making a hormone 
utilizing such a transformed cell line. 

35 The present invention demonstrates for the first time 

the feasibility of synthesizing a polypeptide hormone in 
eucaryotic cells grown in culture and transformed by a genomic 
clone comprising the gene for said hormone. The practice 
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of the invention is illustrated by the‘ cloning of human 
insulin DNA f its characterization showing extensive inter¬ 
vening sequences, and its expression in mouse Ltk aprt” 
grown in culture. 

5 The genomic clone of the present invention and cells 

transformed thereby, capable of expressing the polypeptide 
or protein hormone and coded by said gene, are useful for 
synthesis of a polypeptide or protein hormone without resort * 
to ijn vitro processing reactions. The invention also prov- 
10 ides means for making cell lines suitable for implantation to 
supplement endogenous production of the hormone for thera¬ 
peutic purposes. For example, many diabetics and pre¬ 
diabetics have functional B cells capable of supplying part 
of the patient's insulin needs and responsive to normal 
15 metabolic control mechanisms. Implanted cells modified 
according to the present invention could provide a basal 
level of insulin for such patients, saving their endogenous 
B cells for use during times of peak insulin demand. Such 
treatment is highly advantageous since it provides the 
20 patient with adequate levels of insulin responsive to normal 
physiological controls. 

The cells used in the present experiments, Ltk - aprt" 
are derived from Ltk - clone D described by Kit, S., et al., 
supra . The cells are maintained in Dulbecco's modified 
25 Eagle's medium containing 1025 foetal calf serum, commercially 
available from Flow Laboratories, Rockville, Maryland. The 
Ltk aprt - cell strain is further described by Wigler, M. et al 
Proc. Nat. Acad. Sci. , .76, 1373 (1979). 

Restriction endonucTeases were obtained commercially 
30 from New England BioLabs, Cambridge, Massachusetts, and were 
used under reaction conditions specified in the manufacturer's 
instructions. 

The invention is next illlustrated by the specific 
examples of cloning and expression of the human insulin gene. 
35 However, the invention is not intended to be limited to these 
specific examples and must be deemed broadly applicable to 
genomic expression of highly regulated proteins generally, 
especially polypeptide hormones. 
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The invention provides a cloned DNA segment derived 
from the human genome comprising the gene for human insulin 
and being capable of being expressed in a. eucaryotic cell, 
a transfer vector comprising said cloned DNA, a micro- 
5 organism containing and replicating said transfer vector and 
a eucaryotic cell strain, transformed by the cloned gene and 
being capable of expressing said gene. The transfer vector, 
microorganism and eucaryotic cell strain have been placed on 
deposit with the American Type Culture Collection, 12301 
10 Parklawn Drive, Rockville, Maryland. The deposited transfer 
vector designated P BR322/gH112.5 as described in Example 1, 
and has ATCC accession number 40023. The microorganism £. 
coli HB101pBR322/gHI12.5, described in Example 1, has ATCC 
accession number 31605. The eucaryotic cell line, desig- 
15 nated Ltk~aprt‘gHI12.5 clone 6-4, described in Example 3, 

.was given ATCC accession number CRL-8028. 

E XAMPLE 1 - Expression of human genomic c lones 
A library of fragments of human foetal liver DNA has 
been cloned in bacteriophage lambda Charon 4A by Lawn, R.M., 
20 et al. Cell 15. 1157 (1978). A preparation of this phage was 
provided by Dr. T. Maniatis. The phages were plated on suit¬ 
able indicator bacteria as described by Cordell, B., et al., 
Cell 18, 533 (1979) at a dilution yielding 104 plaques per 
plateT - The plaques were screened for the presence of human 
25 insulin gene sequences by in situ hybridization (with radio- 
actively labelled cloned cDNA coding for human preproinsulin 
- see our co-pending European Patent Application 80303195.4 
filed on 11 September 1980). Approximately 10 plaques were 
screened in this manner, of which two were found to contain 
30 DNA which hybridized with the probe. These clones were 

purified by growth of phage stocks derived from the relevant 
plaques and their DNA was further characterized by restnc- 
ion enzyme analysis. The DNA of each of the phage stocks 
was digested with EcoRI, the digested DNA fractionated by 
35 gel electrophoresis and the fragments which contained 

insulin gene sequences were identified by hydridization 
with radiolabelled human insulin cDNA, using the method of 
Southern, F.M.. J.~ Mol. Biol ., 9B,. 503 (1975). The clones 
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were found to contain the insulin gene fragments on EcoRI 
fragments of 12.5 kb, 14.1 kg and were designated gHI 12.5 
and gHI 14.1, respectively. (Molecular size of DNA is 
conveniently measured by the length in units of 1,000 base 
5 pairs, abbreviated kb). Since the human DNA library was 
originally constructed from DNA fragments obtained by 
essentially random cleavage, the data suggested by the two 
clones represented overlapping regions of human DNA. This 
view was confirmed by further restriction mapping. For 
10 comparison purposes, a cloned cDNA fragment incorporating 
the entire human preproinsulin coding sequence is about 
0.416 kb in length. 

The 12.5 kb Eco RI fragment of gHI 12.5 was sub-cloned 
in plasmid pBR322 at the EcoRI site. The recombinant trans- 
15 fer vector was designated pBR322/gHI 12.5. The plasmid was 
replicated in E. coli HB101 transformed with b8R322/gHI 
12.5, hence designated £. coli HB101 pBR322/gHI 12.5. 

Further restriction mapping, coupled with hybridization, 
made it possible to map the position of the insulin gene on 
20 the 12.5 kb fragment. The restriction mapping experiments 
indicated that the gene was not colinear with the mature 
mRNA, since the extent of the coding regions was greater 
than the 416 base pairs present in the cloned human insulin 
cDNA used as a probe. The exact location and structure of 
25 the gene was determined by DNA sequencing, as described 
below. Figure 1 shows a restriction map of the 12.5 kb 
fragment/ showing the location of the insulin sequence and, 
in more detail, the general locations of coding sequences, 
intervening sequences and untranslated sequences in the 
30 insulin gene. It can be seen, for example, that the enzyme 
• Smal generated two fragments of about 900 snd. ; 1500 base 
pairs respectively, which hybridized with the cDNA probe. 
Since no Sma I sites were known in the cDNA sequence, it was 
concluded that the gene possesses an intervening sequence 
35 containing a Sma l site. 

EXAMPLE 2 

The human insulin gene was sequenced by the method of 
Maxam, A. and Gilbert, W., Procv. Nat.: Acad. Sci. USA , 74, 560 
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(1977). The sequence determined extended from the PvuII 
site on the 5'-side of the gene, with respect to the mRNA, 
to a point 97 base pairs beyond the termination codon at the 
-3'-end. The sequence is shown in Table 1. In addition to 
5 the nucleotide sequence of the human gene and adjacent 

regions, the translation of the coding regions into prepro¬ 
insulin is presented. The boxed sequence TATAAAG is commonly- 
found near the 5'-starting point of transcription. The 
asterisk symbolizes the putative 5'-end of the mRNA, where 
10 the characteristic capping reaction of eucaryotic mRNA 

processing occurs. The arrow indicates the 3'-end of the 
mRNA, based upon homology with the rat insulin II gene, 
Lomedico, P., et al, Cell 11. 545 (1978). The identified 
intervening sequences are contained within parentheses. 

15 Every fiftieth base is numbered, the last digit of the 
number being placed over the corresponding base. The 
entire human preproinsulin sequence is contained within 
the coding region, which in turn is interrupted by a large 
intervening sequence within the region coding for the C- 

20 peptide. 

E XAMPLE 3 - Expression of cloned human insu lin gene 
Expression of the human insulin genomic clone described 
in Example 1 was obtained by transformation of L cells in 
tissue culture. The fragment used for transformation was 
25 that designated gHI 12.5, carried on plasmid pBR322/gHI 12.5, 
replicated in E. coli HB101. The strategy of the experiment 
was amultaneously to transform the cells with a marker DNA of 
known transforming activity and with the insulin DNA. The 
insulin DNA was present in excess to increase the probability 
30 that cells transformed with marker would already have 
received the insulin gene as well. 

Ltk‘aprt~ cells used in these studies were obtained 
from Dr. Richard Axel of Columbia University. The cells 
were genetically deficient in thymidine kinase activity and 
35 in adenine phosphoribosyltransferase activity. The cells 
were cultured in DME medium supplemented with 105 (v/v) 
foetal calf serum, Dulbecco, R., Virology , £,369 (1959). 

The marker DNA codes for thymidine kinase and was derived 
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from Herpes simplex virus, and was cloned in a 3.4 kb 

BamHI fragment. 64 ng of thymidine kinase plasmid, together 

with 26 pg of insulin plasmid were used to transform 10 ** 

cells. Successful transformation was detected by the 

5 ability of cells to grow in the same growth medium, supple- 

-4 -7 

mented with 5 x 10 M hypoxanthine, 4 x 10 M aminopterin, 
and 1.6 x 10 ”^Mthymidine, hereinafter termed HAT medium. The 
genetic defect of the L cells used herein is such that their 
survival in HAT medium depends upon an active thymidine 
-jq kinase enzyme. Therefore, cells transformed with the Herpes . 
viral thymidine kinase gene are able to form colonies. The 
transformation rate in the present experiment was about 40 
colonies per 10 ^ cells. 

The transformation was carried out essentially as 
15 .described by Wigler, M., et al, Proc. NatV Acad/ Sci. USA, ' 

. 7fr,. 1373 (1979). Cells were seeded at a density of 2.5 x 10^ 
per .100 mm dish for 48 hr. grown in fresh medium for 4 hr, 
and then inoculated with the described DNA mixture. After 
48 hours' incubation at 37°C, the cells were then cultured 
20 in the HAT selection medium, which was changed every three 
days. After eighteen days, 20 HAT-resistant-colonies were 
selected. They were grown for several passages to accumulate 
material sufficient for DNA and insulin analysis. 

Transformed cells were lysed in a proteinase K, sodium 
25 dodecyl buffer as described in Wigler, M., et al., Cell , 16 , 
777 (1979). The cell lysate was extracted first with an 
equal volume of buffer-saturated phenol and second with 
chloroform-isoamyl alcohol (24:1, v/.v). High molecular 
weight DNA was isolated by mixing the aqueous phase with 
30 two volumes’of cold ethanol and immediately removing the 
precipitate that formed. The DNA was dissolved in 1.0 mM 
Tria, 0.1 mM EDTA, pH 8.0. Agarose gel electrophoresis of 
restriction endonuclease-digested DNA and hybridization with 
labelled human insulin cDNA were performed as described by 
35 Southern, above. Endogenous (mouse) insulin DNA is 

distinguishable from human since the former is cleaved 
internally by Eco RI endonuclease while the latter is not. In 
addition, under stringent hybridization conditions / the human 
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cONA probe hybridizes preferentially with the human insulin 
DNA. Human insulin coding sequences were found in all of the 
transformed cell lines tested. 

The expression of ipsulin by transformed cells was 
tested by a radioimmunoassay method described by Rail, L.8., 
et al., Endocrinology, 105 , 835 (1979). The transformed 
cells were harvested, sonicated in acid-ethanol, and lyo- 
philized, as described by deGasparo, H. et- al,, Develop. Biol., 
47, 106 (1975). The cell extracts were redissolved in 
distilled water and assayed as described.. Three of 20 thymi¬ 
dine kinase-positive clones were found to produce insulin, or 
an insulin-related molecule immunologically reactive with 
insulin antibody. Estimates of the amount of insulin made 
were based upon reactivity of known quantities of pork 
insulin in the assay. The most active cell line, designated 
Ltk"aprt~gHI 12.5 clone 6-4, produced about 1500 insulin 
molecules per cell. The fact that most transformed clones 
did not produce detectable insulin was in marked contrast to 

results reported by Lai, H.C. et al., Proc. Nat. Acad._Sci. 

USA, 77244 (1980) and suggest that factors necessary for 
obtaining expression of particular genes in eucaryotic cells 
are not well defined. 

Figure 2 shows the radioimmunoassay results for the 
three insulin-producing clones. The results are plotted as 
the percent of•radioactive standard displaced as a function 
of the amount of insulin in the sample. The solid line is a 
standard curve based on porcine insulin. The arrows indicate 
results obtained with three different clones, at the indic¬ 
ated dilutions.. Assuming that the activity per molecule of 
the synthesized product is equivalent to that of pork 
insulin, the transformed cells were calculated to produce 
about 1,000 molecules per cell of the expression product. 
Control cells containing no human insulin sequences did not 
produce this product. Based upon its immunochemical 
reactivity and its expression in response to the presence of 
the human insulin gene, the expression product is considered 
to be identical with or closely related to human insulin. 
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EXAMPLE U 

The expression- of a human insulin genomic clone was 
obtained using an SV60 cloning vector. For the purpose of 
this experiment, the 12.5 kb fragment of human DNA containing 
the insulin gene was cleaved with restriction' enzymes Xho l 
in B^m. Seven DNA fragments were generated, and the 3 kb 
fragment containing the insulin gene was fractionated and 
purified by preparative gel electrophoresis. The 3 kb 
fragment was treated with Si nuclease and DNA polymerase 'I 
to provide blunt ends, to which were attached Hin dlll 
oligonucleotide linkers. The fragment was then sub-cloned 
at the Hin diII site of pBR322. 

SV AO DNA, 100 mg. was digested with Hin di 11 and Taq I 
endonucleases to yield several fragments, one of which 
contains genetic determinants for the origin of SV AO DNA 
replication and the promoter for late SV AO genes, Laub et 
al. , Virology 92 , 310 (1979). This fragment, termed the SV 
AO 0/p fragment^ was purified by preparative gel electro¬ 
phoresis, and was ^.milarly cloned at the Hin dllI site of 
pBR 322. 

Purified sub-cloned insulin fragment, 3yg, was mixed 
with a two-fold molar excess of purified sub-cloned SVAO 0/P 
fragment in a DNA ligase catalyzed joining reaction. The 
reaction products were fractionated according to size, and 
circular DNA composed of two repeats of the SVAO 0/P fragment 
covalently linked to an insulin DNA segment was selected. 

This DNA was designated SVAO/insulin DNA. 

The replication of SVAO/insulin recombinant is 
achieved by cotransfection of monkey cells with-wild-type 
SVAO DNA, as described. Forty-eight hours post-cotransfection, 
viral DNA is extracted from infected cells by the Hirt pro¬ 
cedure (Hirt, J. Hoi. Biol ., 26 , 365 (1965) •). The procedure 
removes high molecular weight DNA, leaving SVAO wild-type 
and SVAO/insulin DNA in the supernatant fraction. The DNA is 
fractionated by gel electrophoresis, both before and after 
treatment with endonuclease Eco RI. Wild-type SVAO DNA has an 
Eco RI cleavage site which is lacking in the SVAO/insulin DNA. 
Similarly, the two can be distinguished on cesium chloride- 
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ethidium bromide gradients. A stock of SV40/insulin DNA is 
prepared From an Eco RI digest followed by fractionation and 
selection of closed circular DNA. 

Monkey cells are transfected by DNA as described by 
5 Mertz and Berg, Virology , 62 ,. 112 (1974). The' cells are 
mixedly transfected with 3yg SV40/insulin DNA and 1 pg: 

SV40 wild-type DNA. The cells are then examined 48 hours 
after transfection. 

Transcription expression of the human insulin gene in 
10 cotransfected cells is detected by RNA blotting techniques. 
RNA is isolated separately from nuclear and cytoplasmic 
fractions of the co-infected cells. The RNA was denatured 
in glyoxal, fractionated by electrophoresis and analyzed for 
the presence of human insulin coding sequences by the method 
15 of Alwine, J.C., et al., Proc. Nat/ Acad. SciV USA , 74' ,’ 5350 
(1977). Using labelled insulin cDNA as a probe, RNA 
corresponding to the insulin gene sequence is detected in 
co-transfected cells. 

For the purpose of detecting the production of insulin, 
20 the radioimmunoassay described in Example 3 is employed. 

In addition, monkey cells transfected with the SV4Q/insulin 
DNA were labelled with 35S-methionine for 46 to 48 hours 
post-transfection.. Nuclear and cytoplasmic fractions are 
separated and the polypeptides are analyzed by sodium 
25 dodecylsulfate polyacrylamide gel electrophoresis. Indivi¬ 
dual protein bands are visualized by autoradiography. An 
additional band of protein the size of proinsulin is observed 
in the cytoplasmic fraction of cells transfected with the 
SV40/insulin DNA and in the incubation medium, but not in 
30 cells transfected with wild-type SV40, or non-transfected 

cells. These results are consistent with the conclusion that 
the SV40/insulin DNA serves as a transfer vector for the 
cloned human insulin gene and that expression of the gene in 
susceptible host cells can occur. 
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CAGCTGTGASCAGCCACAGG7CTCGCCACCGGGCCC7TGC77AAGACTCTAATGACCCGCTCCTCCTGACS 

100 

AACAGGTCCCCACCACCAAGCAGATCTTCCCACACACCCAGCACCAGCCAAATCCCTCCCCAAA7TGCACC 

150 200 
CTCAGCCCCCAGCCA7C7GCCGACCCCCCCACCCCGCCCTAATGGGCCAGCCCCCAGCCGTTGACAGGTAG 

2 50 • 

GGCAGA7GCCCTCTCASAC G5S5S5! CC AGCGGGGGCCC ACC AGCCCTC AGCCCTCCACCACACGC7G 

300 350 

CA7CACAAGAGGCCATCAACCAG (GTCTGTTCCAAGGGCCTT7CCCTCAGCTGCGCTCAGGGT7CCACGC 

400 _ 

TCSCTGGACCCCAGGCCCCAGCTCTGCAGCACGGAGGACGTGGCTCCGCTCCTCAAGCATGTGCCGGTCAS 

<50 

CCCAGCCGCCCCAAGGCACGGCACC7CCCC7TCACCCTCCC7CAGCCCTGCC7CTC7CCCAG) ATCACTG 

500 Met Ala Leu Trp Met Arg Leu Leu Pro Leu Leu Ala Leu Leu Ala 

TCC7TCTGCC ATG CCC CTG TGG ATG CGC CTC CTG CCC CTG C7C GCG CTG CTG GCC 

Leu Trp Cly Pro Aso Pro Ala Ala Ala Phe Val Asn Gin His Leu Cys Gly Ser 

CTC TGG GCA CCT CAC CCA GCC GCA GCC TTT GTG AAC CAA CAC CTG TCC CGC TCA 

His Leu Val Glu Ale Leu Tyr Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr 

CAC CTG GTG GAA GCT CTC TAC CTA CTG TGC GCG GAA CGA CGC TTC TTC TAC AC A 

Pro Lys Thr Arg Arg Glu Ala Glu As? Leu Gin V 700 

CCC AAC ACC CGC CCC GAG GCA GAG GAC CTG CAG G . (GTGAGCCAACCCCCCATTGCTGC 

750 

CCC7CGCCGCCCCCAGCCACCCCCTCC7CCTCGCGCTCCCACCCACCATGCCCACAACCCGGCAGGAGCCT 

800 850 

GCCACCCAGCAGGCGGTCAGGTGCACTTTTTTAAAAAGAAGTTCTCTTGGTCACGTCCTAAAACTGACCAG 

■ 900 

CTCCCTGTGGCCCAGTCAGAATCTCACCCTCAGGACCCTGTTGCCTTCGCCACCCCCGACATACATCACAC 

950 

GCTGC GC ACCC TCC TCCC TCC ACTCG CCCC TCAAAC AA ATCCCCCGC AGCCC ATTTCTCCACCCTC ATTTG 
1000 1050 

ATGACCGCAGA7TCAAGTGTTT7GT7AAGTAAAGTCCTGGGTCACCAGGGGTCACAGCGTGCCCCACGCTG 

* 1100 

CCTGCCTCTGGGCGAACACCCCATCACGCCCCCACCAGGGCGTGCCTGCCTGCCTGAGCGGCCCACACCCC 

1150 1200 
TGTCGCCAGCCTCACGCCACCTCCATAGTCAGGAGATCGGCAACATGCTGGCGACAGGCCCTGCCCACAAG 

12 50 

TAC TCGGATCACCTG TTC ACGC TCCC ACTGTGACGCTGCCCCGGGGCCGGGGAACGAGCTCGGACATGTGG 

1300 - _ : 

CCCTTGGGGCCTGTAGGTCCACACCCAGTGTGGGTCACCCTCCCTCTAACCTGGCTCCACCCCCGCTGGAG | 

1400 * 

ATCGCTCGCACTCCGACCTACGGCTCCCGGGC4CCCGCGCACTCTCTCTCCCTCACTCTGTCCTCCTCTCT J 

1450 al Cly Cln Val 

CCCTC7GCCTCGCCGCTCTTCCCCAACCTGCTCTCCGCGGCACG7CCTGGCAC) TG CGG CAC CTC 

Glu Leu Cly Gly Cly Pro Gly Ala Gly Ser Leu Gin Pro Leu Ala Leu Glu Gly 

CAG CTG CGC CGC GCC CCT GGT CCA CGC ACC CTC CAC CCC TTC CCC CTC CAG CCC 

Ser Leu Cln Lys Arg Gly He Val Clu Cln Cys Cys Thr Ser He Cys. Ser Leu ! 

TCC CTC CAG AAG CCT CCC ATT CTG CAA CAA TCC TCT ACC ACC ATC TCC TCC CTC j 

Tyr Gin Leu Clu Asn Tyr Cys Asn AH 1450 

TAC CAC CTC CAC AAC TAC TCC AAC TAC ACCC ACCCTCC ACCC ACCC CCACACCCCCCCCCT 

j 1700 _ 

CCTCCACCCACACACATCCAATAAACCCCTTCAACCACC CCTGCTCTGCCCTTCTCTCTCTGCCCCCCCT 

1750 

CCCCAACCCCCACTTCCCCCCACT C T X G 7CACCCCCTCCCACCTCTCTCC/.CCCTC TCTCCC7GCC 
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CLAIMS 

1. A cloned DNA segment derived from the human genome, 
comprising the gene for a polypeptide hormone, capable of 
being expressed in a eucaryotic cell* 

2. The cloned DNA segment of claim 1, wherein the 
polypeptide hormone is insulin. 

3 . The cloned DNA segment of claim 1, designated gHl 
12.5. 

A. The cloned DNA segment of claim 1, designated gHI 14.1. 

5. A DNA transfer vector comprising the cloned DNA 
segment of claim 1• 

6. The DNA transfer vector of claim 5, wherein the cloned 
DNA segment comprises the human insulin gene. 

7. The DNA transfer vector of claim 6, comprising pBR322 
with gHI 12.5 inserted at the Eco RI site thereof. 

8. . A microorganism containing and replicating the DNA 
transfer vector of claim 1. 

9. A microorganism according to claim 8, wherein the 
transfer vector comprises a cloned DNA segment comprising 
the human insulin gene. 

10. The microorganism according to claim 9, wherein the 
transfer vector comprises the plasmid pBR322 with gHI 12.5 
inserted at the Eco RI site thereof. 

11. The microorganism of claim 10, comprising £. coli RRI. 

12. A eucaryotic cell strain, grown in culture, trans¬ 
formed by the cloned DNA segment of claim 1, said cell 
strain capable of expressing said gene. 

13. The cell strain of claim 12, wherein the cloned DNA 
segment comprises the gene for insulin. 

14. The cell strain of claim 12 or 13, comprising mouse 
Ltk~aprt~ cells. 

15. The cell strain of claim 14, wherein the cloned DNA 
segment is selected from the group comprising gHI 12.5 and 
gHI 14.5. 

16. A cell culture comprising the eucaryotic cell strain 
of claim 12 and a growth medium suitable for growth and 
replication of said cell strain. 

17. A cell culture according to claim 16, wherein said 
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cell strain is transformed by a cloned DNA segment 
comprising the gene for insulin. 

18. The cell culture of claim 16 or 17, comprising mouse 
Ltk~aprt cells. 

19. A process for making human insulin comprising the 
steps of: 

..(a) preparing the cloned DNA segment of claim 2, 
3 or 4 containing the human insulin gene; 

(b) transforming a eucaryotic cell strain grown 
in culture with said cloned DNA segment; 

(c) selecting transformed cells capable of 

expressing said human insulin gene; 

(d) making an extract of said cells; and 

(e) purifying insulin from said extract. 

20. The DNA transfer Vector, of claim 6, comprising a 
portion of the SV40 genome and a portion of gHI 12.5 
containing the insulin gene. 
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CLAIMS 

1.' A process for preparing a DNA transfer vector for 
replicating a ONA segment containing a gene coding for a 
human polypeptide hormone comprising isolating a DNA seg¬ 
ment containing said gene and inserting said segment into a 
DNA transfer vector characterized in that said DNA segment is 
derived from the human genome and said gene is capable of 
being expressed in a eucaryotic cell. 

2. The process of claim 1, wherein the polypeptide hor¬ 
mone is insulin. 

3. The process of claim 1, wherein the DNA segment is 
designated gHI 12.5. 

A. The process of claim 1, wherein the DNA segment is 

designated gHI 14.5. 

5. The process of claim 3, wherein gHI 12.5 is inserted 
at the EcoRI site of pBR323. 

6. The process of claim 3, wherein the DNA transfer 
vector comprises a portion of the SV40 genome. 

7. A process for preparing, a microorganism containing 

and replicating a DNA transfer vector comprising transforming 
a microorganism with said transfer vector characterized in 
that said transfer vector is prepared according to any of 
claims 1 to 6. 

8. The process of claim 7 wherein said microorganism is 

£• coli RRI• 

9V A process for preparing a eucaryotic cell strain, 
grown in culture, containing a DNA transfer vector and 
capable of expressing said transfer vector comprising trans¬ 
forming a microorganism with said transfer vector, 
characterized in that said transfer vector is prepared 
according to any of claims 1 to 6. 

10. The process of claim 9 wherein said cell strain 
comprises mouse Ltk aprt cell9. 

11V A process for preparing a cell culture comprising a 
cell strain and a growth medium suitable for growth and 
replication of said cell strain characterized in that said 
cell strain is prepared according to claim 9 or 10, 

12. A process for preparing human insulin comprising: 
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(a) preparing a transfer vector containing a 
gene for human insulin prepared according to any of claims 
2 to 6; 

(b) transforming a eucaryotic cell strairr grown 
in culture with said transfer vector; 

(c) selecting transformed cells capable of 
expressing said insulin gene; 

(d) making an extract of said cell's, and 

(e) purifying human insulin from said extract. 
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